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ABSTRACT 

The  objective  of  this  research  was  to  investigate  the 
feasibility  of  introducing  computer  vision  methods  to  the 
building  construction  site.   The  primary  use  envisioned  is 
direct  input  of  data  from  a  video  medium  to  the  computer  for 
the  purpose  of  productivity  analysis. 

The  first  step  described  is  a  familiarization  with  a 
fundamental  computer  vision  system. 

Then,  the  results  of  processing  actual  footage  from 
construction  sites  and  other,  existing  buildings  are 
described.   There  are  observations  made  as  to  the  effect  of 
backlighting  of  the  structure,  lighting  conditions  and 
shadowing,  and  physical  obstructions.   These  concepts  are 
illustrated  by  digitized  images  of  the  structures  observed. 

Proposals  are  then  made  as  to  methods  to  insure  precise 
repeatable  placement  of  observation  cameras.   The  alternate 
proposal  is  translation  of  images  obtained  from  different 
camera  positions  through  the  use  of  on-screen  reference 
po  ints . 

Finally,  a  summary  of  the  physical  barriers  and 
technological  problems,  and  suggested  courses  of  action,  is 
provided . 
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CHAPTER  1 
INTRODUCTION  AND  BACKGROUND 


Traditional  Data  Acquisition 
on  the  Construction  Site 


Productivity  can  be  defined  as  the  ratio  of  input 
resource  units  to  output  units  (input/output).  In  the 
context  of  building  construction,  this  can  be  specified  as 
manhours-,  machine  hours-,  or  dollars-per-ton,  -lineal  foot, 
-square  foot,  or  -cubic  yard. 

Productivity  on  the  construction  site  is  currently 
determined  by  several  means.   The  simplest,  yet  least  useful 
for  productivity  improvement  programs,  is  after-the-fact 
comparison  of  total  resources  expended  versus  quantities  in 
place.   Several  more  useful  methods  involve  data  collection, 
on  the  job  site,  of  work  in  progress.   This  data  could  be  in 
the  form  of:   Time-lapse  photography;  Work.  Sampling;  or 
Manual  timing  of  operations. 

The  method  of  time-lapse  photography  involves  the  view- 
ing of  video  footage  (or  film)  of  construction  operations. 
Observations  are  then  made  as  to  inefficiencies  in  proce- 
dures, wasted  travel  time  to  locations  of  work  and  mater- 
ials, and  the  actual  production  rate  of  work  in  place. 

Work  sampling  involves  random  surveys  of  the  job  site 
to  observe  whether  "effective  work"  is  underway.   Effective 


work,  is  defined  as  those  activities  that  either  actually 
place  materials  in  the  structure  or  directly  support  such 
activities.   Ineffective  work  is  everything  else  (standing 
around,  drinking  a  soda,  or  transporting  materials  long 
distances).   By  gathering  sufficient  data  on  all  activities, 
a  picture  of  the  effectiveness  of  the  work  force  can  be 
drawn. 

Manual  timing  of  operations  involves  observers  on  the 
job  site  with  stopwatches  and  notepads  recording  actual 
performance  times  of  the  activities  of  concern.   This  method 
is  extremely  manpower  intensive. 

These  productivity  analysis  methods  all  involve  some 
statistical  analysis  of  the  job.   Statistical  sampling  is 
necessary  due  to  the  large  expenditure  of  time  and  capital 
involved  in  continuous  observation.   This  large  expenditure 
occurs  not  only  during  the  collection  of  the  data,  but  also 
during  the  required  analysis. 

The  collection  of  data  is  the  first  of  four  steps  in 
the  statistical  sampling  process.   The  remaining  steps  are: 
(2)  organization  of  data,  (3)  analysis  of  data  (determina- 
tion of  "descriptive  statistics"),  and  (4)  interpretation  of 
data  ("inferential  statistics")  (Dillman  1981,  pp.  7-8). 
Theoretically,  proper  evaluation  of  a  statistical  sample 
provides  reasonably  accurate  results. 

The  final  three  steps  listed  above  are  well  suited  for 
accomplishment  by  computer.   However,  manual  data  collection 


and  manual  input  to  the  computer  fail  to  utilize  the  full 
potential  of  the  technologies  being  developed  in  the 
manufacturing  industry. 


Historical  Development/System  Trend: 
in  Computer  Vision 


"Pattern  recognition"  can  be  defined  as  the  computer- 
ized process  of  analyzing  a  sensory  image  and  differentia- 
ting and  recognizing  the  component  parts  of  that  image. 
This  concept  is  widely  used  in  the  factory  environment  for 
quality  control  and  for  the  control  of  robotics. 

The  current  state-of-the-art  pattern  recognition  system 
requires  that  very  distinct  patterns  be  provided.   In  the 
factory  this  is  accomplished  by  separating  individual  items 
and  closely  scrutinizing  them  under  optimal  lighting  condi- 
tions.  This  is  impractical  on  the  construction  site,  where 
lighting  is  largely  uncontrollable  and  the  distances  to  the 
object  are  greater;  but  then  the  requirements  at  this  stage 
are  not  to  discern  small  individual  components  but  to  survey 
bulk,  quantities  of  production,  e.g.  square  feet  of  wall, 
feet  of  pavement  or  pipe,  etc.   These  differences  in  objec- 
tives and  physical  characteristics  necessitate  a  different 
class  of  optics  and  recognition  systems  than  those  used  in 
the  factory  environment. 


Qbiectivf??=;  of  Study 

The  objective  of  this  research  was  to  investigate  the 
feasibility  of  introducing  computer  vision  methods  to  the 
building  construction  site.   This  objective  was  identified 
as  a  primary  consideration  for  introducing  automated  data 
collection  for  productivity  analysis  of  the  construction 
process  (Thomas  and  Smith  1987). 

The  tasks  necessary  to  fulfill  this  objective  are: 

1.  Learn  the  capabilities  of  an  entry-level  computer 
vision  system  with  regard  to  its  ability  to 
reconstruct  a  given  image  with  varying  levels  of 
contrast. 

2.  Select  sample  structure(s)  for  data  collection. 

3.  Determine  the  feasible  angles  of  view  for 
observation  cameras,  and  their  related  distances, 
based  upon  the  nature  of  the  object  observed  and 
the  specifications  of  the  camera  system. 

4.  List  the  steps  involved  in  correlating  the 
dimensions  of  the  visual  image  processed  by  the 
computer  vision  system  with  the  actual  dimensions 
of  the  observed  object,  and  the  corresponding 
areas . 

5.  Identify  the  physical  and  technological  barriers  to 
effective  use  of  entry  level  vision  systems 
hardware  in  construction. 


Methods 

The  first  requirement  was  an  in-depth  literature  search 
and  review.   Publications  on  computer  vision  and  construc- 
tion management  were  reviewed  for  information  on  previous 
work  on  vision  systems  employed  for  productivity  data 
collection.   In  addition  to  published  literature,  course 
notes  from  the  Computer  Vision  and  Inspection  course  offered 
at  the  Pennsylvania  State  University  were  reviewed  as  a 
primer  on  vision  system  techniques  and  applications. 

In  conjunction  with  the  literature  search,  the  labora- 
tory exercises  for  the  aforementioned  computer  vision  course 
were  performed  to  familiarize  the  author  with  the  equipment. 

Field  data  collection  was  performed  using  a  Panasonic 
video  camera  and  recorder.   Time  of  day,  temperature, 
weather,  and  location  were  all  considered  prior  to  collec- 
tion of  data.   The  following  structures  were  considered  as 
potential  subjects:   Mid-State  Bank  (under  construction); 
Atherton  Hotel  (under  construction);  and  Centre  Community 
Hospital  (not  under  construction).   Upon  selection  of  Centre 
Community  Hospital  for  analysis,  as-built  drawings  were 
obtained  for  additional  information. 

Analysis  was  comprised  of  executing  pixel  counts  on  the 
images  collected.   These  images  were  then  qualitatively 
evaluated  for  indications  of  where  system  improvements  are 


required,  and  where  potential  technological  limitations 
exist. 

The  findings  of  the  study  enabled  the  author  to  deter- 
mine that,  although  feasible,  substantial  work  remains  prior 
to  prototype  applications. 


CHAPTER  2 
HARDWARE  AND  SOFTWARE 


Determination  of  PCEYE 
System  Capabilities 


The  computer  vision  system  utilized  for  this  study  is 
PCEYE,  marketed  by  Chorus  Data  Systems.   Its  unit  cost  is 
under  $1000,  exclusive  of  the  computer  and  camera  equipment. 
The  PCEYE  system  used  for  this  study,  as  available  in  the 
Computer  Vision  and  Inspection  Laboratory  (CVIL)  of  the 
Mechanical  Engineering  Department  of  The  Pennsylvania  State 
University,  includes  the  following  components: 

-IBM  Personal  Computer  with  256K  RAM,  dual  360K  drives 

-Color  Monitor,  using  Color  Graphics  Adapter 

-PCEYE  System  Board 

-Black,  and  White  Video  Camera 

-Black  and  White  Video  Monitor 

-Video  Cassette  Recorder  (VCR) 

-Epson  Dot  Matrix  Printer 
All  laboratory  work  (e.g.  image  processing)  was  accomplished 
in  CVIL  using  this  equipment. 

Input  Characteristics 

The  PCEYE  system  accepts  as  input  the  standard  analog 
RS-170  signal,  in  common  use  for  black  and  white  video 
transmission,  or  the  National  Television  Systems  Committee 
(NTSC)  signal,  which  carries  the  information  for  color 
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images  in  addition  to  the  RS-170  data.   The  NTSC  signal  is 

common  to  television  and  VCR  transmissions.   The  RS-170 
signal  is  illustrated  in  Figure  2.1.   This  signal  causes  the 
scanning  beam  in  a  television  picture  tube  to  scan  across 
the  tube  in  consecutive  horizontal  lines  from  top  to  bottom. 
It  does  ifhis  thirty  times  each  second. 

This  continuous  function  (signal)  is  translated  into  a 
discrete  (digitized)  function  by  an  analog-to-digital 
converter  on  the  PCEYE  system  board. 

The  PCEYE  system  is  also  capable  of  retrieving  images 
which  were  previously  saved  to  disk.   These  disk-saved 
images  are  currently  what  must  be  used  for  analysis  and 
comparison. 

The  laboratory  setup  used  for  course  work  typically 
feeds  the  signal  directly  from  the  black  and  white  camera 
through  the  video  monitor  to  the  PCEYE  board.   For  the  pur- 
poses of  this  study,  it  was  necessary,  and  in  fact  desir- 
able, to  record  the  images  on  videotape  at  the  building 
site,  and  replay  them  on  a  VCR  in  the  laboratory  for  input 
to  the  system.   This  was  desirable  because  it  more  precisely 
matches  the  probable  procedure  that  will  be  used  in 
construction  practice. 

Resolution 

Fineness(Size.Crispness) .   The  PCEYE  system,  as  con- 
figured, breaks  the  image  into  320  picture  elements 
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RS-170  SIGNAL  (Carlson  and  Gisser,  1981) 

FIGURE  2. 1 
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(pixels)  horizontally  by  200  pixels  vertically.   Using  the 

example  of  a  building  100  feet  tall  which  fills  the  screen 
vertically,  each  vertical  pixel  then  equates  to  six  inches. 

The  resolution  of  the  320  by  200  system  is  the  minimum 
recommended  for  use.  Figure  2.2(a)  is  the  upper  left  64  by 
64  image  extracted  from  the  PCEYE  image  in  Figure  2.2(b). 

Intensity .   The  existing  system  uses  four  grey  levels 
to  represent  a  digitized  image.   These  four  grey  levels,  or 
light  intensities,  are  displayed  as  four  pseudo  colors  on 
the  color  monitor.   In  ascending  order  they  are:   black, 
blue  (cyan),  red  (magenta),  and  white. 

The  user  can  adjust  the  sensitivity  of  the  digitizer 
until  the  image  observed  meets  the  desired  intensity  levels, 
the  resultant  image  could  be  black,  and  white,  or  four  color, 
whichever  brings  out  the  desired  details.   The  sensitivity 
is  actually  assigned  by  directing  the  lowest  intensity  that 
is  interpreted  as  white,  and  the  highest  intensity  that  is 
interpreted  as  black.   The  system  then  distributes  the  other 
two  grey  levels  between  these  two  limits. 

Image  Processing 

Color  versus  Black  and  White.   There  is  usually  no 
problem  inputing  a  color  signal  to  a  simple  frame  grabber 
such  as  PCEYE,  since  the  color  signal  usually  contains  the 
same  information  in  the  same  format  as  a  black  and  white 
signal.   Any  problem  that  arises  is  based  upon  the  limi- 
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tation  that  such  a  frame  grabber  is  designed  solely  to 

handle  black  and  white  images.   The  extraneous  color 
information  can  be  misinterpreted,  causing  interference 
and/or  a  fuzzying  of  the  image;  therefore  the  color  portion 
of  the  signal  is  normally  filtered  out.   Systems  that  make 
use  of  the  color  signal  are  under  development.   What  will  be 
found  in  the  initial  development  of  color  systems  is  to  a 
greater  degree  what  was  found  in  grey  level  systems.   The 
more  grey  levels  there  are,  the  more  complex  and  slow  the 
software  becomes.   The  distinct  advantages  arise  in  that  the 
system  becomes  more  tolerant  to  variations  in  lighting,  thus 
requiring  less  special  lighting.   For  example,  backlighting 
is  usually  required  for  a  black  and  white,  edge  detecting 
system. 

For  the  system  utilized  in  this  study,  the  color  of  the 
object  only  plays  a  role  in  the  development  of  an  image 
inasmuch  as  the  color  can  create  a  differentiation  in  the 
intensity  of  the  light  reflected  by  that  object. 

Volume  of  individual  grev  levels.   The  PCEYE  system 
package  contains  two  programs  which  process  the  digitized 
image  and  report  the  number  of  pixels  at  each  grey  level. 
The  images  processed  by  these  programs  must  have  been 
previously  saved  to  disk  in  a  format  different  from  the 
standard  PCEYE  format  (see  CAPTURE. BAS  below). 

The  first  program  processes  the  entire  image  (frame), 
which  takes  two  to  three  minutes.   The  product  is  a  histo- 
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gram  showing  graphically  the  number  of  pixels  per  grey 
level . 

The  second  program  will  process  any  rectangular  portion 
of  the  image  designated  by  the  user,  up  to  the  full  screen. 
This  can  take  over  an  hour,  but  more  detailed  information  is 
provided.   The  user  designates  what  "colors"  are  part  of 
the  object,  and  is  told  the  size  of  the  object,  its  centroid, 
and  if  there  are  any  "holes"  in  it  (visually).   This  program 
was  developed  at  the  Pennsylvania  State  University  and  is 
not  available  with  PCEYE. 

Image  Enhancement.   Using  this  system,  the  only  prac- 
tical method  of  enhancing  the  image  is  proper  adjustment  of 
the  black,  and  white  levels.   These  levels  determine  the 
upper  limit  of  light  intensity  that  will  be  interpreted  as 
black  and  the  lower  limit  interpreted  as  white.   The  user 
sets  the  limits  by  either  entering  the  levels  into  the 
parameter  table  available  in  the  PCEYE  program,  or  by 
adjusting  the  image  as  it  appears  on  the  screen. 

Output 

By  definition,  the  hardware  has  one  output:   digitized 
images.   The  software  processes  these  images,  which  can  be 
viewed  on  a  monitor,  printed  on  paper,  or  saved  on  disk. 
The  value  of  the  system  lies  in  the  various  pixel  counting 
and  image  comparison  programs. 
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PCEYEIO  is  the  PCEYE  program  (in  BASIC)  whose  main 
utility  is  to  "grab"  the  analog  signal,  then  display  and/or 
save  the  resultant  digitized  images.   DIFFPGM,  SHOWPIC,  and 
PRNPICS  use  the  images  saved  by  PCEYEIO. 

DIFFPGM.EXE  compares  two  PCEYE  images  pixel  by  pixel. 
It  then  reports  the  number  and  percentage  of  pixels  that 
do  not  match.   The  sensitivity  of  the  program  is  set  by  the 
user  during  each  run,  meaning  that  the  user  may  choose  the 
range  of  grey  levels  that  can  cancel  each  other.   For 
example,  choosing  zero  for  the  sensitivity  means  only  exact 
matches  cancel  one    another  (blue  cancels  only  blue).   A 
sensitivity  of  one  means  pixels  within  one  level  of  each 
other  will  cancel  (blue  could  now  also  cancel  black  and 
red).   Obviously,  zero  should  be  the  only  feasible  setting 
for  this  system,  since  there  can  be  a  great  difference 
between  the  extremes  of  light  intensity  of  adjacent  grey 
levels  in  this  system.   Figure  2.3  illustrates  the  relative 
error  of  a  four  grey  level  system  to  a  sixteen  grey  level 
system  when  the  sensitivity  is  set  to  one. 

This  assumption  of  the  required  sensitivity  was  tested 
in  the  laboratory.   From  testing,  it  was  found  that  using 
zero  resulted  in  a  fairly  constant  mismatch  of  approximately 
two  percent  of  the  pixels  in  any  two  images:   1)  grabbed 
close  together  in  the  laboratory;  2)  under  artificial 
lights;  and  3)  directly  from  the  camera.   This  error  is 
minor  and  consistent,  and  therefore  of  little  concern.   Then 
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the  step  of  recording  these  same  images  onto  videotape  was 
introduced.   When  the  images  were  replayed  to  the  system, 
the  standard  error  increased  to  approximately  3.8  percent. 
Again,  though  the  error  nearly  doubled,  it  is  relatively 
minor  and  consistent. 

When  images  obtained  in  the  field  were  subjected  to  the 
same  test,  the  errors  were  in  the  fifteen  percent  range. 
This  error  is  no  longer  acceptable;  it  is  evident  that  the 
difference  is  the  result  of  the  uncontrolled  characteristics 
of  the  natural  lighting  and  is  therefore  not  easily  predict- 
able.  The  change  in  lighting  appears  to  vary  beyond  the 
bounds  of  one  PCEYE  grey  level.   A  solution  might  be  to  use 
one  instead  of  zero  for  the  sensitivity.   As  discussed 
above,  such  a  decrease  in  the  sensitivity  yields  results 
which  can  be  too  gross  for  realistic  analysis.   This  problem 
will  be  resolvable  upon  introductions  of  more  grey  levels, 
where  sensitivity  can  be  more  finely  tuned. 

This  may  be  a  moot  point,  as  the  pixel  for  pixel 
matching  is  effective  only  where  the  two  images  can  be 
absolutely  overlaid.   The  significance  of  this  constraint 
will  be  discussed  in  Chapters  3  and  4. 

SHOWPIC.EXE  and  PRNPICS.EXE  exist  to  display  previously 
saved  PCEYE  images  on  the  screen,  and  to  print  such  images, 
respectively.   They  serve  no  direct  analytical  purpose; 
however,  review  of  hard-copy  images  can  bring  insight  to  the 
patterns  assigned  by  the  system.   This  is  demonstrated  in 
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Figure  2.4,  where  portions  of  the  dark  brown  aluminum  panels 
present  a  pattern  similar  to  the  buff  brick. 

HISTO  takes  the  complete  screen  image  and  produces  a 
histogram  of  the  number  of  pixels  in  each  grey  level.   When 
counting  the  overall  number  of  pixels  per  grey  level  in  an 
image,  the  need  for  exact  alignment  is  relaxed,  but  camera 
coverage  still  must  be  approximately  the  same.   This  is  in 
contrast  to  the  exact  overlay  required  when  doing  the  point- 
by-point  comparison  described  above.   In  this  case,  indivi- 
dual pixels  may  still  have  the  fluctuation  noted  while  using 
DIFFPGM,  but  by  comparing  the  whole  field  of  pixels,  the 
variations  of  individual  pixels  should  tend  to  balance. 

MEDAOI,  developed  at  the  Pennsylvania  State  University, 
lists  the  number  of  pixels  in  each  grey  level  of  an 
image,  or  any  rectangular  portion  thereof;  the  user  defines 
the  upper  left  and  lower  right  corners.   The  capability  of 
this  program  to  take  any  portion  of  the  image  is  a  distinct 
advantage,  as  areas  external  to  the  object  can  be  reduced, 
but  it  is  extraordinarily  slow. 

Videotape  Equipment  &  Recording  Media 

There  is  a  great  range  of  equipment  available  on  the 
market  today  for  recording  video  images.   The  quality  of 
images  produced  can  usually  be  correlated  to  the  expense  of 
the  system;  therefore,  it  will  be  important  to  determine  the 
minimum  image  quality  necessary  for  processing. 
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Camera  Features:   Advantages  and  Disadvantages 

Tube  Tvpe  (Vidicon)  versus  Solid  State  ( CCD ) .   There 
are  two  general  video  camera  types  that  are  available  to  the 
retail  purchaser.   These  are  illustrated  in  Figure  2.5. 

The  first  is  the  older,  tube  type  camera,  also  known  as 
"vidicon."   This  now  conventional  device  captures  the  visual 
image  by  passing  an  electron  beam  across  a  reactive  screen. 
This  produces  an  analog  signal  which  is  combined  with  a 
synchronizing  signal  within  the  camera,  and  is  output  as  the 
RS-170  (or  NTSC)  signal.   This  type  of  camera  is  currently 
more  prevalent  due  to  its  longer  time  on  the  market. 

Although  this  is  the  type  of  camera  used  in  this  re- 
search (because  it  was  available),  it  is  not  preferable  for 
use  on  the  construction  site;  the  tube  used  in  these  cameras 
is  fragile,  susceptible  to  rough  handling  and  environmental 
extremes,  and  is  subject  to  "burn-in"  (physical  damage  to 
the  reactive  screen)  if  directed  at  a  light  source,  such  as 
a  window  reflecting  the  sun  or  a  welder  at  work. 

The  second,  newer  type,  the  solid  state  camera,  is  also 
known  as  a  charge-coupled  device  or  "CCD".   This  type  of 
camera  also  has  a  screen  onto  which  the  image  is  cast, 
although  the  screen  here  is  an  array  of  "wells"  on  the  sur- 
face of  a  semiconductor  chip.   The  array  is  electronically 
scanned,  and  produces  a  digital  signal.   Unfortunately,  this 
digital  signal  is  then  converted  to  analog  within  the  camera 
in  order  to  match  the  current  standard  (RS-170)  for  record- 
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ing  on  standard  videotape  and  for  viewing  on  a  standard 
television  or  monitor.   Also,  as  discussed  above,  this 
convention  has  necessitated  the  use  of  an  analog-to-digital 
converter  in  any  computer  vision  system. 

The  advantages  of  the  solid  state  camera  are  its 
overall  .hardiness  over  the  tube  type,  and  its  increasing 
availability.   The  solid  state  components  are  not  as  suscep- 
tible as  the  tube  to  environmental  conditions  or  burn-in. 
Manufacturers  have  almost  phased  out  the  tube  type  camera  in 
favor  of  the  solid  state  cameras  for  retail  sales. 

Zoom  lens  capabilities.   Use  of  a  fixed  focal  length 
lens  greatly  limits  the  flexibility  of  any  data  collection 
system  in  that  it  creates  an  extremely  narrow  range  envelope 
from  which  the  required  data  can  be  collected.   A  zoom  lens 
can  increase  that  envelope  practically  indefinitely. 

As  an  example  of  the  flexibility  that  a  zoom  lens  pro- 
vides, the  camera  used  in  this  research  has  an  eight-to-one 
zoom  (10.5-84  mm),  giving "the  user  a  large  "envelope"  within 
which  to  set  up.   In  the  case  of  a  building  100  feet  tall, 
for  example,  the  usable  distance  for  total  coverage  of  the 
building  ranges  from  about  150  feet  to  1200  feet. 

This  topic  will  be  discussed  further  in  the  treatment 
of  camera  placement. 

Automatic  and  manual  light  level  adjustments.   Most 
current  video  cameras  have  the  capability  for  automatic 
light  level  control.   Most  of  these  allow  the  user  to  switch 
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to  manual  control.   There  are  distinct  advantages  and  disad- 
vantages to  each  setting. 

The  automatic  light  level  adjustment  feature  allows  the 
camera  to  adjust  the  iris  to  meet  existing  light  levels  in 
the  environment.   This  allows  the  overall  lighting  levels  to 
change  and  produce  no  significant  difference  in  the 
perceived  image  as  captured  by  the  camera.   The  major 
problem  with  the  automatic  adjustment  occurs  when  there  is  a 
"hot  spot"  within  the  field  of  view,  or  even  a  bright  object 
near  the  camera  and  not  in  the  picture.   The  former  could  be 
a  highly  reflective  surface,  such  as  a  window,  or  a  welder 
at  work;  the  latter  a  car  in  the  foreground  (as  happened 
during  the  study).   Either  causes  the  camera  to  close  down 
its  aperture  because  of  the  perceived  light  increase,  and 
causes  the  recorded  image  to  be  dark. 

The  manual  adjustment  of  the  light  level  allows  the 
user  to  ensure  that  the  light  reflected  from  the  surface  of 
interest  remains  fairly  constant.   This  requires  the  overall 
lighting  remain  constant.   If  the  period  of  observation  is 
short,  and  assuming  natural  light  and  a  relatively  clear 
day,  little  change  in  overall  lighting  is  likely.  In  this 
ideal  situation  it  is  a  simple  matter  to  use  the  fixed, 
manually  set  iris  found  on  any  standard  camera.   In  the  more 
common  situation,  it  will  be  necessary  to  determine 
independently  from  the  camera  what  the  actual  ambient 
lighting  level  is,  through  the  use  of  a  properly  shielded 
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and  directed  light  meter,  and  thereby  set  the  proper  iris 
manually  or  electronically.  The  latter  system  would  simply 
be  a  customized  automatic  light  level  adjustment,  but  would 
overcome  the  problems  inherent  in  the  standard  through-the- 
lens  system.  Again,  as  discussed  previously,  minor  changes 
in  lighting  can  be  compensated  by  reducing  the  sensitivity 
on  a  better  grey  level  system. 

Videotape  Characteristics 

Video  cassette  recorders  (VCRs)  are  designed  to  produce 
a  color  video  signal  which,  while  carrying  the  same  informa- 
tion as  the  standard  RS-170  signal,  also  carries  the  extra 
information  that  is  used  to  produce  a  color  image.   As  noted 
above,  most  frame  grabbers  are  equipped  to  filter  out  the 
extra  information. 

Resolution.   The  resolution  of  a  videotaped  image 
appears  to  be  comparable  to  the  direct  input  of  the  camera. 
The  experimental  results  described  above,  using  the  compari- 
son program,  would  seem  to  support  this  assumption.   The 
variations  in  the  videotaped  image  that  were  imperceptible 
to  the  human  eye  were  picked  up  by  the  PCEYE  system,  but 
were  acceptable. 

Distortion.   Another  inherent  problem  with  using  the 
videotape  medium  is  the  susceptibility  of  the  tape  to  wear, 
causing  degradation  of  the  signal  output,  and  the  mechanical 
wear  of  the  VCR  mechanism,  causing  an  increase  of  line 
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noise.   Such  degradation  and  interference  can  be  controlled 
by  limited  reuse  of  videotape,  and  proper  maintenance 
(especially  head  cleaning)  of  equipment. 
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CHAPTER  3 
POSITION  OF  CAMERA  -  SITE  TESTING 

Nature  of  Observed  Object 

The  nature  of  the  observed  object  is  critical.   Its  na- 
ture is  its  general  shape,  dimensions,  finishes,  edges, 
depth,  and  visual  accessibility  ( unobscured ) . 

Humans  can  observe  an  object,  and  subconsciously  "fill 
in  the  gaps."   The  computer  cannot  do  this,  so  it  is 
necessary  for  the  entire  image  to  be  captured  by  the  camera 
or  other  equipment  in  use.   The  ability  to  capture  that 
entire  image  is  directly  related  to  and  restricted  by  the 
capabilities  of  the  equipment  in  use.   In  this  specific 
case,  the  equipment  is  a  standard  video  camera  with  a  zoom 
lens.   The  zoom  lens  allows  great  flexibility  in  locating 
the  camera  so  the  entire  image  can  be  captured  and  the 
effects  of  distortion  caused  by  being  too  close  to  a  large 
object  can  be  minimized.   It  can  also  introduce  other 
errors,  as  will  be  discussed  later  in  this  chapter. 

Another  factor  which  will  come  into  play  is  the  nature 
and  orientation  of  lighting.   People  have  a  much  more  con- 
tinuous discernment  of  light  levels  than  the  computer,  and 
are  therefore  more  tolerant  of  poor  lighting  conditions. 
Where  these  poor  lighting  conditions  exist,  for  instance 
casting  dark:  shadows,  proper  analysis  will  be  difficult. 
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Volume.,  Area,  or  Length 

A  vertical  area,  such  as  a  wall,  provides  the  greatest 
ease  of  observation.   The  reason  for  this  is  the  great  flex- 
ibility available  in  camera  setup  positions.   As  will  be 
demonstrated,  the  error  incurred  by  varying  the  position  of 
the  camera,  within  the  operational  envelope,  is  minimal  and 
easy  to  calculate  (see  Chapter  4  for  errors  introduced  by 
camera  position) .   This  is  also  true  for  the  error  intro- 
duced by  the  distance  differences  of  the  extremes  of  the 
object.   Furthermore,  as  will  be  discussed  in  Chapter  4,  it 
is  a  simple  matter  to  make  a  direct  correlation  between  the 
number  of  pixels  in  the  digitized  image  and  the  actual  area 
observed . 

These  statements  concerning  vertical  areas  can  be 
adapted^  to  horizontal  areas,  provided  that  the  camera  can  be 
positioned  within  the  corresponding  operational  envelope. 

The  system  may  be  applied  very  simply  to  calculate 
lengths  by  establishing  the  endpoints  of  the  object.   Or,  if 
the  width  of  the  object  is  known,  the  area  calculations  can 
be  manipulated  to  yield  the  width. 

The  most  complex  feature  to  observe  is  a  volume.   It  is 
probable  that  proper  analysis  of  a  volume  is  beyond  the 
capability  of  a  pattern  recognition  system  of  the  nature  now 
under  consideration.   Volume  calculations  are  most  often 
associated  with  earthmoving  operations,  and  are  therefore 
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probably  in  the  realm  of  photogrammetry,  at  least  where  the 
limitation  is  to  visual  clues. 

Planar  or  Curved  or  Noncoplanar 

While  actual  observation  of  a  complex  geometry  is  no 
more  difficult  than  that  of  a  simple  plane,  translation  of 
such  an  image  into  useful  data  for  measurement  of  area  or 
productivity,  is.   The  simplest  case  of  a  planar  object 
allows  direct  translation  of  the  digitized  image  to  actual 
area.   When  the  object  is  not  planar,  the  user  must  break 
the  image  into  segments  that  the  computer  will  handle 
individually . 

Identification/Description  of 
Selected  Structures  for  Study 

Initial  observations  were  collected  at  construction 

sites  in  State  College,  Pennsylvania.   These  included  a 

small  office  building  and  a  hotel.   It  was  determined  that 

these  construction  sites  introduced  too  many  variables  at 

this  stage  of  study.   However,  the  images  obtained  clearly 

illustrated  the  potential  and  the  problems  of  using  a  vision 

system.   Figure  3.1  is  a  PCEYE  image  showing  a  portion  of  a 

small  office  building  under  construction.   It  was  possible 

to  see  through  to  the  other  side  of  the  building;  this  is 

reflected  in  Figure  3.1  where  the  scaffolding  on  the  back,  of 

the  building  appears  in  the  image. 
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Figure  3.2  shows  images  developed  from  the  Atherton 
Hotel,  State  College,  at  different  stages  of  its  construc- 
tion.  It  can  be  plainly  seen  that  a  simple  vision  system 
here  has  the  capability  to  detect  differences  in  the  areas 
of  work  in  place.   In  Figure  3.2(a),  there  are  large  areas 
of  wall  that  show  bare  studs,  with  only  the  windows 
installed.   The  jagged  appearance  of  the  roofline  in  this 
figure  is  the  result  of  plastic  sheeting,  draped  over  the 
edge  of  the  roof  to  keep  out  the  rain,  blowing  in  the  wind. 
Figure  3.2(b)  clearly  shows  that  the  sheathing  has  now  been 
installed,  and  the  plastic  sheeting  has  been  removed. 

The  southwest  facade  of  Centre  Community  Hospital, 
State  College,  Pennsylvania,  was  finally  chosen  for  this 
study.   This  subject  is  of  the  simplest  nature,  as  described 
above,  in  that  it  is  planar,  for  the  most  part,  and  is 
relatively  unobscured  by  extraneous  materials,  such  as 
foliage,  plastic  sheeting,  scaffolding,  etc.   In  addition, 
it  provides  a  limited  variety  of  construction  materials  for 
observation. 

Nature  &  Condition  of  Illumination 

Natural  Light ( Sunl ight ) 

Needless  to  say,  the  position,  intensity,  color,  and, 
in  fact,  existence  of  natural  light  are  completely  uncon- 
trollable.  These  characteristics  can  vary  greatly  from  day 
to  day,  and  will  over  periods  of  months. 
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ATHERTON  HOTEL  UNDER  CONSTRUCTION  SHOWING 
(a)  BARE  STUD  WALLS  AND  PLASTIC  SHEETING 
AND  (b)  CLOSED  IN  WALLS 
FIGURE  3.2 
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All  observations  for  this  project  were  taken  under  con- 
ditions of  natural  lighting.   As  stated,  the  facade  under 
consideration  has  a  southwest  exposure.   For  reference,  the 
observations  were  taken  in  mid-January. 

Time  of  Day.   Observations  were  taken  at  three  times  on 
the  same  day:   9:00  a.m.,  12:15  p.m.,  and  3:30  p.m.   These 
times  were  chosen  to  center  the  observations  between  local 
sunrise  and  sunset.   The  9  a.m.  footage  shows  a  distinct 
shadow  line  across  the  facade  not  evident  in  either  of  the 
other  observations.   This  shadow  line  moves  noticeably  in 
the  short  (approximately  fifteen  minute)  period  of  filming. 
As  foreseen  above,  the  computer  cannot  discern  the  continu- 
ation of  the  wall  into  the  shadow  line;  this  could  introduce 
a  large  error  over  the  time  span  when  the  shadows  move 
rapidly,  i.e.  early  morning  and  late  afternoon. 

Weather .   On  the  day  of  observation,  the  weather  was 
officially  described  as  partly  cloudy,  but  was  subjectively 
judged  to  be  a  very  clear  day.   Temperatures  ranged  from  the 
low  teens  to  the  mid-thirties  (degrees  Fahrenheit)  through 
the  day.   These  temperatures  were  below  the  recommended 
operational  ranges  of  the  videotape  equipment,  and  although 
performance  was  satisfactory,  the  rechargeable  batteries  for 
the  system  required  frequent  changing. 

Orientation.   The  orientation  of  the  sun  varies  greatly 
between  the  three  observation  periods.   The  three  times  of 
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observation  were  chosen  intentionally  to  demonstrate  the 
differences  introduced  by  the  changed  angle  of  incidence  of 
the  sunlight  upon  the  building  face.   The  specifics  of  how 
this  orientation  will  change  is  also  dependent  upon  the  time 
of  year. 

As  can  be  seen  in  the  morning  image  (Figure  3.3),  the 
sun  was  low  and  to  the  southeast  of  the  building.   The  adja- 
cent hospital  wing,  on  the  right,  is  casting  the  large 
shadow  across  the  observed  face.   The  noon  observation 
(Figure  3.4)  placed  the  sun  almost  directly  behind  the 
observation  camera.   This  caused  the  reflection  from  the 
buff  brick,  surfaces  to  saturate  the  image,  reducing  the 
contrast.   The  late  afternoon  image  (Figure  3.5)  shows  more 
contrast  than  the  noon  image,  but  the  nature  of  the 
illumination  is  also  different  from  that  observed  in  the 
morning.   The  overall  light  level  is  higher  than  in  the 
morning  image.   Additional  images  from  the  three  observation 
periods  can  be  found  in  Appendix  A. 

Artificial  Light. 

The  use  of  artificial  light  has  both  advantages  and 
disadvantages.   Its  primary  advantages  are:   the  light  level 
is  constant  and  controllable;  the  orientation  of  the  light 
source  is  fixed  and  controllable;  and  additional  lighting 
can  be  added  to  minimize  shadowed  areas.   The  primary 
disadvantages  are:   artificial  lighting  is  expensive;  and  it 
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NINE    A.M.     IMAGE    OF    CENTRE    COMMUNITY    HOSPITAL 
SHOWING    SHARP    SHADOWLINE 
FIGURE    3. 3 
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NOON  IMAGE  OF  CENTRE  COMMUNITY  HOSPITAL 

FIGURE  3.4 
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AFTERNOON  IMAGE  OF  CENTRE  COMMUNITY  HOSPITAL 

FIGURE  3.6 
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can  cause  harsh  shadow  lines  which  the  computer  cannot  "see" 
through. 

Orientation  with  respect  to  object.   Control  of  the 
orientation  of  artificial  lighting  was  cited  as  a  primary 
advantage  in  the  previous  section.   This,  of  course,  is  in 
contrast  to  natural  light  variations  and  the  weather. 

There  are  three  overall  orientations  which  can  be 
addressed  for  the  illumination  of  a  building  exterior.   The 
first  two  involve  lighting  attached  to  the  building  itself 
and  directed  either  up  from  the  bottom,  or  down  from  the 
top.   This  scheme  of  lighting  is  acceptable  in  a  constructed 
facility  where  the  sole  purpose  is  to  show  the  building. 
However,  in  a  building  under  construction,  the  purpose  of 
illumination  is  to  enable  the  craftsman  to  carry  out  a  task. 
Lighting  in  the  first  configuration  will  likely  restrict 
access  to  the  structure.   In  the  second  configuration,  the 
light  will  likely  be  in  the  workers'  eyes.   In  either  case, 
the  lighting  is  not  an  aide  but  a  potential  safety  hazard. 

The  third,  and  preferable,  configuration  is  for  the 
lighting  equipment  to  be  located  some  safe  distance  away, 
and  the  light  directed  back  onto  the  structure.   The 
distances  used  are  obviously  dependent  upon  the  size  of  the 
building,  the  site  conditions,  and  the  capabilities  of  the 
equipment.   Also,  the  lighting  should  be  placed  at  whatever 
height  best  illuminates  the  areas  of  interest,  as  is 
physically  possible. 
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The  artificial  illumination  used  to  see  the  exterior  of 
the  building  is  not  the  only  case  of  concern  in  capturing 
images.   As  can  be  seen  in  the  prints  of  the  images  (Figures 
3.3,  3.4,  3.5,  and  Appendix  A),  lighting  inside  the  building 
drastically  changed  through  the  course  of  the  day.   In  the 
observed  case,  the  changing  interior  lighting  was  the  result 
of  light-colored  curtains  in  the  windows  being  opened  and 
closed,  and  reflecting  the  sunlight.   This  is  an  excellent 
illustration  of  the  corresponding  effect  that  interior 
lights  on  a  construction  site  would  have  on  the  image.   In 
the  worst  case,  it  would  adversely  effect  the  overall 
lighting  level,  and  degrade  the  usefulness  of  the  image  for 
comparison.   Much  the  same  effect  can  be  observed  in  the 
situation  where  unshielded  welding  is  ongoing. 

Another  form  of  "artificial"  light  which  should  be  con- 
sidered is  backlighting.   If  the  building  is  at  a  stage 
where  you  can  see  all  the  way  through  it,  the  light  coming 
through  is  indistinguishable  from  the  light  reflected  from 
the  structure.   This  was  clearly  illustrated  in  Figure  3.1. 

Interference  and/or  Combination  with  Available  Natural 
Light.   The  relationship  between  natural  and  artificial 
light  could  be  most  beneficial  when  the  artificial  is  used 
to  fill  in  the  gaps,  or  shadows,  left  by  the  natural.   In 
bright  daylight  this  is  unrealistic,  but  in  the  early 
morning  or  evening,  or  in  heavily  overcast  conditions,  this 
could  be  of  great  benefit.   The  gap  in  this  hypothesis  is 
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that  these  periods  already  require  artificial  lighting  due 
to  the  low  natural  lighting  level. 

Camera  Focusing  Precision 

As  described  earlier,  the  differentiation  program 
showed  a  regular  error,  even  when  observations  were  taken 
very  close  together,  the  camera  was  not  moved,  and  even  when 
the  observations  were  taken  under  artificial  lights  in  the 
controlled  setting  of  the  laboratory.   The  error  was  larger, 
but  still  fairly  consistent,  when  the  additional  variable  of 
natural  light  in  the  field  was  introduced.   The  basic  error 
level  may  be  the  result  of  inconsistency  by  PCEYE  when 
assigning  the  grey  scale  values  to  an  image.   We  cannot  pre- 
dict this  error  ahead  of  time,  but  we  can  determine  it  under 
each  new  set  of  field  conditions,  and  account  for  it  in  our 
calculations . 

The  problem  in  using  a  differentiation  type  program  is 
that  the  compared  images  must  be  matched  precisely.   This 
known  as  image  registration.   Using  standard  video  equip- 
ment, daily  camera  setups  are  impossible  to  duplicate.   The 
actual  precise  placing  of  a  standard  video  camera  is  as 
simple  as  using  a  plumb  bob;  the  key  for  this  application  is 
directing  the  camera  so  as  to  precisely  duplicate  the 
captured  image.   For  this  exact  duplication  to  be  achieved, 
it  will  be  necessary  to  use  methods  of  precision  optical 
alignment,  common  in  surveying;  this  must  be  done  through 
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the  lens  or  through  other  optics  attached  to  the  camera,  and 
referenced  to  a  fixed  object.   From  that  reference,  it  will' 
be  necessary  to  have  the  capability  of  rotating  the  camera 
in  the  vertical  and  horizontal  planes  to  the  desired 
position  (as  with  a  transit  or  theodolite).   This  will 
likely  entail  permanent  modification  and  customization  of 
equipment,  as  the  standard  through-the-lens  image  lacks  the 
precision  necessary. 

Range  Taraetable  at  One  Time  (Depth  of  Field) 

The  primary  limitation  evident  in  the  depth  of  field 
variable  of  the  video  camera  is  the  randomness  with  which 
the  camera  focused  on  the  object  under  wide  angle  view.   For 
example,  at  distances  up  to  300  feet,  the  automatic  focusing 
mechanism  systematically  focused  at  three  to  six  feet, 
instead  of  the  "infinity"  range  expected.   The  visual  image 
does  not  suffer  appreciably,  but  the  actual  result  is  much 
more  homogeneous  to  the  computer.   This  is  acceptable  for 
gross  estimates  of  area,  or  for  simple  counting,  using  the 
current  system.   This  would  not  be  acceptable  for  work 
requiring  higher  precision,  such  as  applications  in  quality 
control . 

Proximity  to  Object  Relative  to  Object  Size 

The  immediate  result  of  collecting  data  at  close  range 
to  a  large  (tall  or  long)  object  is  the  familiar  distortion 
of  a  tall  building  narrowing  at  the  top,  or  the  roadway 
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narrowing  in  the  distance.   This  distortion  decreases  as  the 
relative  distance  from  the  object  increases. 

The  origin  of  this  distortion  is  the  nature  of  the 
"field  of  view"  of  the  viewing  device,  this  being  the  human 
eye  or  the  camera  lens.   Field  of  view  is  expressed  in  terms 
of  an  angle.   When  this  angle  is  a  constant,  as  it  is  for 
any  one  focal  length  of  a  camera,  or  our  eyes,  the  actual 
area  covered  by  the  field  of  view  is  dependent  on  the 
distance  of  the  object  from  the  observer.   As  the  distance 
increases,  the  actual  width  or  height  of  the  field  of  view 
increases  proportionally. 

In  the  present  case,  for  illustration,  we  can  assume 
that  the  object  has  constant  width.   As  the  distance  to  the 
object  increases,  the  object  fills  less  of  the  field  of 
view.   The  observer  perceives  this  as  the  object  becoming 
narrower  at  the  top  or  in  the  distance.   The  greater  the 
difference  between  the  near  and  far  points  on  the  object, 
the  greater  the  distortion.   The  geometry  of  this  situation 
is  illustrated  and  analyzed  in  Chapter  4. 

Role  of  Zoom  Lens  at  Different  Ranges 

As  briefly  stated  above,  the  use  of  a  zoom  lens  can 
greatly  increase  the  flexibility  with  which  we  can  choose  a 
site  for  the  camera  setup.   It  can  give  us  a  virtually 
unlimited  number  of  locations  from  which  the  image  is 
obtainable.   In  addition,  it  can  be  a  primary  tool  for 
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fighting  the  visual  distortion  described  in  the  previous 
section.   With  a  zoom  or  telephoto  lens,  the  observation  can 
be  made  from  a  greater  distance,  thus  removing  the 
distortion,  and  will  still  capture  the  entire  image  desired. 

These  advantage  of  the  zoom  lens  are  not  without  their 
complications,  however.   While  the  flexibility  of  the  zoom 
lens  is  ideal  for  the  construction  site,  where  the  observer 
may  be  required  to  change  position  regularly,  it  also 
introduces  another,  imprecisely  controllable  variable.   In 
the  common  commercially  available  video  camera,  the  focal 
length  markings  on  the  lens  are  vague  at  best.   There  are  no 
"click  stops"  which  would  indicate  precise  settings  for 
focal  length.   The  best  the  user  could  hope  for  is  that  the 
highest  and  lowest  settings  are  reasonably  reliable;  judging 
by  the  performance  of  the  model  used  for  data  gathering, 
this  is  a  poor  assumption.   The  zoom  mechanism  gives  a  very 
poor  response  to  manual  adjustment. 

For  these  reasons,  it  is  recommended  that  great  effort 
be  made  to  establish  fixed  locations  from  which  to  gather 
data,  and  hence  used  one  of  several  fixed  focal  length 
lenses.   If  the  advantages  of  the  zoom  lens  are  of  great 
need  to  the  user,  the  zoom  must  have  precise  metering  for 
rapid  and  correct  determination  of  focal  length. 


42 


CHAPTER  4 
PROCESSING  OF  DATA 

Relationship  of  Pixels  to  Object  Size 
The  computer  does  not  report  the  number  of  square  feet. 
It  reports  the  number  of  pixels  in  the  digitized  image. 
Users  must  understand  that  due  to  the  conditions  on  the 
site,  it  may  be  necessary  to  obtain  data  from  different 
positions.   Each  camera  position  will  have  its  own  unique 
relationship  to  the  building,  which  will  determine  what  the 
number  of  pixels  equates  to  relative  to  the  structure  being 
measured.   Each  unique  pixel  ratio  must  be  provided  to  the 
computer  for  analysis.   There  are  several  alternatives  to 
accomplish  this.   These  alternatives  will  require  the 
creation  or  acquisition  of  software  with  different 
capabilities  than  the  system  employed  in  this  study. 

Manual  Input 

Manual  input  is  probably  the  simplest  method  for 
informing  the  computer  of  the  variables  applicable  to  indi- 
vidual sets  of  acquired  images.   This  requires  that  speci- 
fics relative  to  the  controlling  variables  be  provided  at 
the  prompting  of  the  system. 

First,  some  object  of  fixed  dimension  within  the  frame 
must  be  identified,  from  which  the  computer  can  derive  the 
ratio  of  pixels  to  feet  at  a  given  point  in  the  image.   This 
identification  can  be  accomplished  by  using  the  cursor  keys 
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or  a  "mouse"  to  establish  the  endpoints  of  the  object,  which 
could  be  as  simple  as  a  surveyor's  range  pole.   This  stan- 
dard must  be  at  the  same  distance  as  the  object  of  the 
study,  or  both  distances  must  be  known  and  provided. 

In  addition,  it  will  be  necessary  to  provide  such 
variables  as  distance  from  the  object,  focal  length  of  the 
lens,  the  vertical  (elevation)  angle  of  the  lens,  and  the 
camera  relationship  to  the  centerline  of  object. 

The  necessity  of  the  first  variable  (reference  object) 
is  straightforward,  but  qualified  in  that  the  ratio  deter- 
mined is  in  fact  accurate  only  at  that  point  on  the  screen. 
Deviation  from  the  determined  ratio  increases  with  the 
distance  from  that  point. 

As  discussed  in  Chapter  3,  the.  angular  field  of  view  of 
a  camera  is  fixed,  but  the  actual  dimensions  covered  depend 
directly  upon  the  distance  of  the  object  from  the  lens.   As 
the  distance  increases,  the  field  of  view  grows,  so  an 
object  of  fixed  dimension  appears  to  shrink.   This  explains 
why  a  tall  building  gets  narrower  at  the  top. 

It  is  necessary  to  quantify  this  effect  if  a  true 
output  is  to  be  obtained  from  the  system.   The  ratio  of 
pixels  to  square  feet  is  variable,  and  dependent  upon  the 
location  of  any  particular  point  on  the  screen.   This  is 
controlled  by  the  six-dimensional  relationship  of  the  camera 
to  the  object  (x,  y,  z,  inclination,  horizontal  angle,  and 
tilt).   This  relationship  is  illustrated  in  Figure  4.1. 
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FIGURE  4. 1 
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The  first  step  in  defining  the  equations  which  will 
determine  the  ratio  is  to  define  the  variables  to  be  dis- 
cussed: 

Di.-,  =  horizontal  distance  from  camera  to  object 

ho  =  height  of  reference  object 

h   =  height  of  object  with  respect  to  camera 

Do  =  distance  to  reference  object 

D   =  distance  to  any  point  in  the  image 

Ro  =  screen  row  of  the  fixed  dimension  given 

(x^    =   the  vertical  angle  of  the  camera 

a-^    =   the  vertical  angular  field  of  view 

Co  =  screen  column  bf  the  centerline  of  fixed  dimension 

an  =  horizontal  angle  of  the  camera  with  respect  to 

perpendicular  from  building  centerline 
a^•,  =  the  horizontal  angular  field  of  view 
<5   =  angular  separation  from  reference  point 

These  are  illustrated  in  Figures  4.2,  4.3,  and  4.4. 

The  ratio  of  the  sizes  of  the  fields  of  view  at  any  two 
points  moving  up  or  down  a  structure  will  determine  the 
ratio  of  the  pixel  "sizes"  at  those  levels. 

In  Figure  4.2,  the  dimensions  designated  as  Ho  and  H 
are  physically  the  same.   In  this  example,  however,  the 
distances  to  those  two  points  are  different,  designated  by 
Do  and  D  respectively.   These  values,  shown  in  Figure  4.3, 
are  : 


Do  =  \rK^^^^~r~h^ 

(4-1) 


o=^[ 


D^-,-^    +    (h+dh)=^ 

But  h    =   Di,  tana^o        and  dh    =   Dk,  ( tan(<Xs^o+<5  ) -tana^o  ) ; 

therefore,    by    substitution. 
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(4-2) 


Do  =  Dn  Jl  +  tan^-a>^o  =  Dh.seca^o 
and      D  =  D^^      1  +  tan^-(a>^o  +  (5 )  =  DMsec(a^o  +   6). 
Since  the  field  of  view  is  represented  by 

FOV  =  2  D  tan(a^./2)  (4-3) 

the  ratio  of  the  two  fields  of  view  shown  above  is 


FOV     2  (DKsec(a^o  +  <5 )  )  tan(a^./2) 

FOV  ratio  =  =  (4-4) 

FOVo     2  (Dh.seca^o)  tan(a^./2) 


which  simplifies  to: 


sec(av.o  +  6)  cos{<x^c) 

FOV  ratio  =  =  (4-6) 

sec(avo)        cos(a^o  +  6) 


This  field  of  view  ratio,  which  will  also  be  termed  the 
Pixel  Ratio,  will  act  as  a  multiplier.   By  equating  the 
vertical  screen  dimension  to  a  visual  angle,  the  angular 
difference  between  the  various  points  of  interest,  <5 ,  can  be 
expressed  in  terms  of  the  screen  row  of  the  latter  (D). 

The  system  specifications  will  determine  whether  the 
camera  image  will  fill  or  fit  on  the  screen.   Vertical  and 
horizontal  directions  must  be  addressed  separately.   PCEYE 
clips  the  image  in  both  directions.   For  the  purpose  of  this 
derivation,  o-^    has  been  chosen  as  the  vertical  angle  of  equiva- 
lence, as  if  the  vertical  dimension  of  the  image  has  been 
made  to  fill  the  screen  vertically. 
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Ro  -  R 

<5  =  G^  (4-6) 

200 


The  field  of  view  ratio  now  becomes  the  Pixel  Ratio  by 
substitution  for  <3  in  the  FOV  ratio  equation  (4-5). 


cos(aN^o) 

PR^  =  PIXEL  RATIO  =  (4-7) 

Ro  -  R 

cos(a^o  +   o-^) 

200 


The  V  subscript  indicates  that  this  pixel  ratio  adjusts  for 
the  distortion  due  to  the  differing  distance  in  the  vertical 
direction.   A  similar  derivation,  for  the  horizontal  direc- 
tion, is  contained  in  Appendix  B.   The  variables  involved 
are  illustrated  in  Figure  4.4.   Co  represents  the  centerline 
of  the  reference  known  dimension.   The  resultant  horizontal 
pixel  ratio  is: 

cos  (an.o ) 

PRm  =  (4-8) 

C  -  Co 

cos(aMo  +   a^  ) 

320 

The  total  pixel  ratio,  the  product  of  PR^  and  PRi-,,  then 

gives  the  comprehensive  multiplier  that  can  be  applied  to 

any  point  on  the  screen  and  determine  the  physical  area  to 

which  it  equates,  based  on  its  (Row, Column)  coordinates.   In 

order  to  put  PRr  in  terms  of  the  given  parameters,  it  must 

be  recognized  that: 
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Gsx  o^  10  0  -  Ro 

ttv^o  =  a^  +  -  Ro( )  =  a^  +   Gv    (4-9) 

2  200  200 


a^        a^n  Co  -  160 

and   a^o  =  ttM  -  +  Co( )  =  tth-,  +  an    (4-10) 

2        320  320 


Substituting  into  the  PR>^  and  PRi-,  equations,  PR-r  becomes  a 

function  of  Row  and  Column: 

100  -  Ro  Co  -  160 

cos(a>,  +  Qv)  cos(ai-,  +  Ov^) 

200  320 

PRx  =  (4-11) 

100  -  R  C  -  160 

cos(as^  +  a^)  cos(ah-,  +  an,) 

200  320 

In  this  section,  it  has  been  demonstrated  that  the  com- 
puter may  calculate  areas  by  counting  the  necessary  pixels 
and  applying  their  unique  multipliers  to  obtain  a  measure  of 
the  actual  area.   That  multiplier  can  vary  widely  with  the 
conditions  and  the  values  of  the  multiple  variables. 
Appendix  C  contains  a  short  computer  program  and  several 
runs  showing  the  range  of  multipliers  for  different 
scenarios  using  the  equipment  available.   Searching  a  range 
of  zero  to  forty-five  degrees  for  a^  and  a^,,  the  maximum  PRr 
was  2.6818175  and  the  minimum  0.23047097.   It  should  be 
noted  that  this  is  the  worst  case  scenario,  and  resulted  at 
both  values  equal  to  forty-five  degrees,  and  the  Lens  at  its 
minimum  focal  length  (maximum  a,,,  a^)  . 
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"Automatic"  Input 

The  previous  section  discussed  the  steps  to  be  taken  by 
the  computer,  given  a  required  input  by  the  user.   It  would 
be  preferable  if  the  user  did  not  have  to  repeatedly  take 
manual  measurements,  and  then  report  them  to  the  computer. 
There  are  several  methods  that  may  preclude  this  necessity 
by  automating  part  or  all  of  this  particular  step  in  the 
data  collection  process.   These  include  incorporation  of: 
radio  frequency  (RF)  tr iangulation  for  camera  position;  a 
fixed  three  dimensional  target;  and  utilization  of  "total 
station"  surveying  technology. 

RF  Tr iangulation.   Simple  methods  of  tr iangulation  can 
be  used  to  pinpoint  the  location  (x,  y,  z)  of  the  camera. 
This  data  could  be  stored  directly  on  the  videotape  and  read 
by  the  computer  at  the  beginning  of  any  session. 

Three  Dimensional  Target.   A  three  dimensional  target 
of  known  size,  location,  and  features  would  serve  the 
purpose  of  providing  a  sufficiently  intelligent  and  discri- 
minating system  with  all  the  necessary  variables  described 
above  except  for  one.   It  would  still  be  necessary  for  the 
range  or  the  focal  length  of  the  camera  to  be  provided 
separately.   This  separate  data  input  may  also  be  automated 
through  electronic  reporting  of  focal  length  or  electronic 
rangef inding . 

"Total  Station"  Technology.   A  total  station  is  an 
electronic  surveying  instrument  that  has  the  capability  to 
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electronically  record  all  measurements  taken.   The  (x,y,z) 
coordinates  of  the  station  can  be  determined  by  initiating 
any  equipment  setup  with  readings  taken  of  points  of  known 
coordinates  and  elevation.   The  horizontal  and  vertical 
angles  used  to  sight  these  points  act  as  a  base  of  reference 
for  all  future  observations.   The  distance  to  the  object  is 
determined  by  the  total  station  using  infrared  rangef inding . 

It  is  feasible  for  our  camera  equipment  to  be  mounted  in 
concert  with  the  total  station  so  that  the  two  move  as  one, 
the  camera  being  a  known  distance  above  the  total  station 
reference . 

How  Does  Productivity  Come  From  This? 

At  this  point  it  is  necessary  to  review  and  address  the 
origin  of  this  specific  research  project.   The  purpose  in- 
tended for  the  introduction  of  computer  vision  to  construc- 
tion was  for  automated  collection  and  processing  of 
productivity  data. 

We  now  know  that  the  information  necessary  can  be 
collected  on  videotape  and  fed  to  a  computer  that  should  be 

* 

able  to  determine  the  actual  dimensions  of  the  object  being 
observed.   This  necessarily  includes  the  capability  to  have 
the  video  input  directly  to  a  computer  on  sight.   This  has 
the  advantage  of  providing  more  immediate  data  for  use. 

Determination  of  straight  units-per-hour  productivity 
is  a  simple  matter  of  the  computer  making  observations  at 
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determinable  intervals  and  comparing,  or  subtracting,  the 
characteristics  of  subsequent  images.   Again,  this  can  be 
done  real-time  on  the  job  site,  or  back  in  the  office  at 
normal  speed  or  fast-forward. 

It  is  possible  that  with  the  proper  feedback  and  soft- 
ware, the  computer  will  be  able  to  carry  out  the  operation 
with  little  or  no  supervision.   The  overriding  constraints 
to  this  possibility  at  this  time  is  the  limitations  on  the 
computer  vision  system  to  properly  discern  between  different 
areas  that  may  provide  similar  digital  patterns.   This  was 
observed  in  several  instances  among  the  images  obtained  for 
this  project. 

For  the  foreseeable  future,  until  the  systems  available 
for  use  are  more  refined,  and  the  software  is  developed, 
these  systems  will  require  constant  involvement  and  input  by 
human  operators. 
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CHAPTER  5 
SUMMARY  AND  CONCLUSIONS 

Summary 

Initiative 

The  driving  motivation  behind  the  entrance  into  this 
field  was  the  extremely  labor  intensive  methods  required  for 
productivity  data  collection.   The  current  methods  involve 
manual  collection  of  data  which  require  extensive  review  and 
correlation  before  meaningful  results  can  be  determined. 

A  system  is  desired  with  the  potential  to  provide 
real-time  productivity  analysis,  whereby  the  supervision  on 
the  job  site  can  more  rapidly  respond  to  productivity 
problems,  and  eliminate  those  periods  of  low  productivity 
which  cannot  be  addressed  due  to  the  gap  between  performance 
and  analysis  of  data. 

The  potential  for  automation  of  this  process  can  be 
seen  in  light  of  the  advances  in  the  field  of  computer 
science  and,  more  specifically,  pattern  recognition.   This 
potential  has  accomplished  much  in  the  manufacturing  field 
related  to  high  speed  quality  control  inspection  of  products 
on  the  assembly  line. 
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Approach 

The  basic  approach  taken  in  this  research  project  was 
to  highlight  those  features  unique  to  the  construction  site, 
and  to  explore  the  potential  barriers  in  physical  character- 
istics and  technology.   The  predominant  factors  under 
consideration  are  the  matters  of  lighting,  distance,  and 
visibility  of  object. 

The  initial  step  was  to  become  familiar  with  the  con- 
cepts of  computer  vision  and  the  basic  equipment  available 
for  use. 

Observations  were  then  taken  at  several  construction 
sites.   However,  it  was  found  that  the  dynamic  nature  of  the 
facilities  under  construction  introduced  too  many  variables 
for  proper  analysis  at  this  time. 

After  these  initial  construction  site  attempts,  an 
existing  structure  was  settled  upon  for  the  remainder  of 
the  study.   This  existing  facility  provided  a  limited  number 
of  materials  and  a  fairly  unobscured  view,  thus  reducing  the 
variables  in  the  equation.   Observations  were  then  made  from 
three  separate  distances,  at  three  times  of  day,  at  two 
focal  lengths,  to  determine  and  illustrate  the  effects  of 
the  differences  in  lighting  (intensity  and  orientation)  and 
distance  (under  varied  magnification). 
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Major  Problems  Encountered 

There  were  many  problems  encountered  during  the  process 
of  applying  the  system  to  the  building  environment.  Tables 
5.1  and  5.2  contain  a  summary  of  the  physical  and  techno- 
logical barriers  encountered,  respectively.   The  information 
was  put  in  this  format  in  order  to  provide  the  reader  a 
succinct  and  complete  overview  of  the  perceived  problems. 
These  Tables  should  be  able  to  stand  alone. 
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TABLE  5. 1 
Physical  Barriers  Encountered 


Obstructions    The  existence  of  physical  obstructions 

throughout  the  printed  images  is  readily 
apparent.   These  include  trees  and  shrubs, 
signs,  and  lampposts. 

Suggestions     The  least  disruptive  method  for  removing 

these  obstacles  from  the  image  is  to  move 
inside  the  obstacles;  this  is  not  always 
possible. 

Physically  remove  the  obstacle.   In  most 
cases  this  may  be  infeasible,  in  the  case  of 
large  equipment,  or  unpopular,  as  for  trees. 

Develop  image  "subtraction"  routine  which 
can  recognize  an  obstacle  as  part  of  the 
"landscape"  and  can  filter  out  the  obstacle 
and  fill  in  the  remaining  gaps  by 
"painting"  in  the  missing  object. 

Shadows         Shadows  drastically  change  the  character- 
istics of  the  object  being  analyzed.   Deep 
shadows  caused*  by  bright  lighting  conditions 
are  opaque/black,  to  the  digitizing  system. 

Suggestions     Utilize  vision  system  with  greater  number  of 

grey  levels.   These  systems  more  discri- 
minately  assign  grey  levels  and  may  have 
the  depth  to  assign  non-black  values  to  the 
shadows . 

Fill  in  the  shadows  by  supplementing  natural 
lighting  with  artificial  lighting.   This 
will  alleviate  the  severe  underlighting  that 
causes  the  sharp  shadow  and  makes  the  image 
black  in  that  region. 

Incident        The  angle  of  the  incident  light,  directly 
Lighting        dependent  upon  the  time  of  day  and  year,  or 

the  location  of  the  artificial  lighting,  may 
cause  highly  reflective  surfaces  to  appear 
much  brighter  than  they  actually  are,  or  to 
appear  to  be  other  materials  altogether. 

(continued  on  next  page) 
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TABLE  5. l(continued) 


Suggestions      Incorporate  color  capabilities  into  the 

system  to  better  distinguish  individual 
fields  of  interest. 

Introduce  polarization  to  reduce  the 
intensity  of  light  reflecting  from  polished 
surfaces.   This  will  have  lesser  effect  on 
rough  surfaces  where  the  light  is  reflected 
more  randomly. 

Study  surface  treatment  characteristics  of 
construction  materials. 

Changing        Items  such  as  welding  shields,  scaffolding. 
Features        or  other  temporary  facilities  which  are 

moved  during  construction  will  randomly 
introduce  error  into  the  image. 

Suggestions     Manually  compensate  for  the  error. 

Develop  algorithms  which  can  recognize  such 
obstructions  and  essentially  subtract  them 
from  the  image,  filling  in  the  gaps  left 
behind  with  the  building. 
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TABLE  5.2 
Technological  Problems 


Optical         This  distortion  can  be  of  two  types:   the 
Distortion      narrowing  caused  by  the  expansion  of  the 

field  of  view  with  distance  (e.g.  on  a  tall 
building);  and  the  curving  found  at  the 
edges  of  an  image  caused  by  imprecision  on 
the  part  of  the  camera  lens. 

Suggestions     The  edge  distortion  is  a  direct  function  of 

the  construction  and  quality  of  the  camera 
lens;  therefore  this  error  can  be  eliminated 
by  use  of  higher  quality  equipment. 

The  narrowing  effect  of  distance  can  be 
corrected  by  use  of  special  lens  additions 
that  straighten  the  inclined  lines. 

The  narrowing  effect  can  be  compensated 
through  the  use  of  algorithms  which  define 
the  relative  values  of  pixels  at  different 
locations  on  the  screen. 

The  grossness  of  a  four  grey  level  system 
results  in  images  which  can  be  ill-defined 
and  can  exhibit  some  randomness  in  assign- 
ment of  grey  levels  to  regions  with 
comparable  light  intensities. 

Suggestions      Move  up  to  a  system  with  more  grey  levels. 

The  more  grey  levels  the  system  has,  the 
narrower  will  be  the  actual  range  of  light 
intensities  that  will  be  assigned  to  any  one 
level.   This  will  allow  flexibility  in 
sensitivity  definition  on  a  case  by  case 
basis.   Sixteen  or  sixty-four  grey  level 
systems  should  be  a  sufficient  next  step. 

Precise  Camera   When  the  camera  position  and  alignment 
Position  and     cannot  be  permanent  or  preserved  from  day  to 
Direction       day,  the  existing  equipment  does  not  have 

the  capability  for  precision  in  spatial 
positioning  (x,  y,  and  z)  or  in  alignment 
(elevation,  direction,  and  tilt).   The 
through-the-Iens  optics  are  inadequate  for 
these  purposes. 

(continued  on  next  page) 
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TABLE  5. 2 (continued) 


Suggestions      Customize  camera  equipment  by  mounting  on  a 

calibrated  base  on  a  fully  adjustable 
tripodx  and  attaching  additional  precision 
sights  for  referencing  established  points. 

Mount  camera  in  conjunction  with  existing 
"Total  Station"  technology  to  determine 
actual  position,  rather  than  attempt  to 
establish  a  desired  position,  and  translate 
the  new  images  to  the  old  position 
algor ithmically . 
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Conclusions 

PCSYE  as  a  Learning  Tool 

The  PCEYE  system  used  in  the  research  served  as  an 
excellent  tool  to  learn  the  basics  of  how  a  computer  vision 
system  works.   It  also  helped  to  spark  an  interest  in  what 
the  potential  for  such  a  system  might  be  through  showing 
where  its  shortcomings  were.   These  limitations  are, 
however,  what  keeps  it  from  being  a  feasible  system  for  end 
use  in  this  effort. 

The  primary  limitations  of  the  system  are  its  low  reso- 
lution, small  number  of  grey  levels,  and  incredibly  slow 
processing  speeds.   The  first  two  deficiencies  are  simple  to 
overcome,  as  systems  exist  with  higher  resolution  and  more 
grey  levels;  they  are  also  quite  a  bit  more  expensive.   The 
latter  is  as  much  a  function  of  the  hardware  (a  standard  PC) 
as  the  PCEYE  software.   Again,  this  can  be  overcome  by 
upgrading  to  currently  available  equipment,  at  much  greater 
cost. 

Adequacy  of  Video  Resolution 

The  resolution  of  the  video  images  obtained  were 
adequate  for  the  purpose  of  this  research,  but  for  actual 
practice  it  may  be  desirable  to  use  more  precise  equipment, 
particularly  in  the  areas  of  focusing  and  focal  length 
determination  in  a  zoom  application.   It  must  be  kept  in 
mind  that  the  video  camera  and  equipment  used  were  marketed 


63 

for  home  movies  for  human  observers,  and  are  not  expected  to 
produce  as  precise  an  image  as  we  desire  for  our  automated 
processing  methods. 

Feasibility 

The  successful  construction  of  a  system  that  automates 
the  collection  and  processing  of  visual  data  for  the  purpose 
of  productivity  analysis  is  the  goal.   This  will  require  the 
extended  attention  and  cooperation  of  experts  in  the  fields 
of  computer  science  (and  expert  systems),  pattern  recogni- 
tion, vision  systems,  and  construction  and  productivity. 

The  long  range  goals  of  this  research  are  feasible. 
This  should  be  qualified,  however,  by  saying  that  it  will 
not  be  done  overnight. 

Recommendations  for  Future  Direction 

There  are  several  areas  that  warrant  immediate  consi- 
deration as  the  next  steps  into  adapting  vision  systems  to 
the  construction  industry  are  considered.   The  overriding 
and  required  preliminary  step  to  any  of  these  items,  how- 
ever, is  the  specific  requirement  for  a  new  "level"  of 
equipment  to  be  acquired.   This  equipment  should  have  higher 
resolution,  higher  discrimination  of  light  intensities,  and 
higher  speed. 

The  first  of  these  "next  steps"  or  items  of  considera- 
tion is  the  effect  or  contribution  that  the  different  colors 
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and  finishes  of  construction  materials  make  to  the  produc- 
tion of  an  image.   As  shown  in  Chapter  3  and  Appendix  A, 
similar  patterns  in  the  processed  image  can  be  produced  by 
different  materials.   By  studying  the  effects  of  colors  and 
finishes,  in  conjunction  with  incident  light  characteris- 
tics, -the  error  involved  in  equating  different  materials  can 
be  recognized  and  controlled. 

The  second  item  which  bears  investigation  is  the 
problem  encountered  in  trying  to  see  into  the  shadows.   It 
is  hoped  that  a  more  discerning  grey  scale  breakdown  will 
alleviate  this  problem. 

The  third  item  that  was  not  covered  with  any  depth  to 
this  point  is  the  effects  of  physical  obstructions  on  the 
construction  site,  particularly  those,  such  as  scaffolding, 
plastic  sheeting,  and  other  temporary  structures  whose 
position  is  constantly  or  periodically  changing.   Immovable 
objects,  such  as  trees,  and  other  buildings,  may  be  compen- 
sated throughout  the  study,  but  when  the  obstruction  is 
mobile,  the  compensation  can  no  longer  be  manual  due  to  the 
time  involved.   The  comprehensive  effect  on  the  image  has 
not  been  determined  at  this  time. 

The  fourth,  and  probably  biggest,  step  for  consi- 
deration is  the  introduction  of. this  system  to  interior 
areas  of  the  construction  site.   To  date  this  work  has  been 
directed  at  inspecting  exterior  walls.   It  is  another  matter 
altogether  for  a  system  to  inspect  an  interior  setting  where 
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distances  are  much  smaller  in  proportion  to  the  items  to  be 
observed,  requiring  a  wider  angle  lens  (extremely  short 
focal  length) .   In  addition,  the  camera  cannot  be  stationary 
and  still  obtain  the  necessary  images,  but  its  positions 
must  still  be  determinable  for  subsequent  images  to  be 
compared. 
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APPENDIX  A 
HARD  COPY  PCEYE  IMAGES 
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CLOSE-UP  OF  ATHERTON  HOTEL 
AFTER  SHEATHING  INSTALLED 
FIGURE  A. 2 
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CLOSE-UP  OF  CENTRE  COMMUNITY  HOSPITAL 
HIGHLIGHTING  SHADOWLINE 
FIGURE  A. 3 
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CLOSE-UP  OF  CENTRE  COMMUNITY  HOSPITAL 
HIGHLIGHTING  SHADOWLINE 
FIGURE  A. 4 
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CLOSE-UP    OF    CENTRE    COMMUNITY    HOSPITAL 
HIGHLIGHTING    SHADOWLINE 
FIGURE    A. 5 
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MORNING    CLOSE-UP    OF    CENTRE   COMMUNITY    HOSPITAL. 
NOTE   CURVED    EDGES    AT    TOP    AND    BOTTOM    OF    IMAGE. 

FIGURE    A. 6 
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NOON    CLOSE-UP    OF   CENTRE   COMMUNITY   HOSPITAL. 
NOTE   CURVED    EDGES   AT    TOP    AND    BOTTOM    OF    IMAGE. 

FIGURE    A. 7 
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AFTERNOON   CLOSE-UP   OF   CENTRE   COMMUNITY    HOSPITAL. 
NOTE   CURVED    EDGES    AT    TOP    AND    BOTTOM    OF    IMAGE. 

FIGURE    A. 8 
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CENTRE  COMMUNITY  HOSPITAL 
9  A.M.^  300  FEET,  FULL  ZOOM 
FIGURE  A. 9 
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CENTRE  COMMUNITY  HOSPITAL 

NOON,  300  FEET,  FULL  ZOOM 

FIGURE  A. 10 
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CENTRE  COMMUNITY  HOSPITAL 

3  P.M.,  300  FEET,  FULL  ZOOM 

FIGURE  A. 11 
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CENTRE   COMMUNITY   HOSPITAL 

9    A.M.,    200    FEET,    FULL   WIDE 

FIGURE    A. 12 
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CENTRE  COMMUNITY  HOSPITAL 

NOON,  200  FEET,  FULL  WIDE 

FIGURE  A. 13 
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CENTRE  COMMUNITY  HOSPITAL 

3  P.M.,  200  FEET,  FULL  WIDE 

FIGURE  A. 14 
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CENTRE   COMMUNITY   HOSPITAL 

9    A.M.,     200    FEET,    FULL   ZOOM 

FIGURE   A. 15 
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CENTRE  COMMUNITY  HOSPITAL 

NOON^  200  FEET^  FULL  ZOOM 

FIGURE  A. 16 
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CENTRE  COMMUNITY  HOSPITAL 

3  P.M.,  200  FEET,  FULL  ZOOM 

FIGURE  A. 17 
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CENTRE 
9  A.M., 


COMMUNITY  HOSPITAL 
100  FEET,  FULL  WIDE 
FIGURE  A. 18 
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CENTRE  COMMUNITY  HOSPITAL 

NOON,  100  FEET,  FULL  WIDE 

FIGURE  A. 19 
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CENTRE 
3  P.M.^ 


COMMUNITY  HOSPITAL 
100  FEET,  FULL  WIDE 
FIGURE  A. 20 
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APPENDIX    B 
DERIVATION    OF    PR^ 
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This  appendix  contains  the  derivation  of  the  horizontal 

pixel  ratio  used  in  Chapter  4.   The  variables  used  in  the 

derivation  are  as  follows: 

Df,  =  horizontal  distance  from  camera  to  object 

b   =  offset  distance  to  reference  object  (perp.  to  Dn ) 

Do  =  distance  to  reference  object 

D   =  distance  to  any  point  in  the  image 

Co  =  screen  column  of  the  centerline  of  fixed  dimension 

a.-,  =  horizontal  angle  of  the  camera  with  respect  to 

perpendicular  from  building  centerline 
o^■,    =  the  horizontal  angular  field  of  view 
6      =   angular  separation  from  reference  point 

All  variables  used  in  this  derivation  are  illustrated  in 

Figure  4.4. 


Do  =  Nj(D^n=-  +  b^"^) 
D  = 


\l 


(B-1) 


(D^-  +  (b+db)-^) 


But   b    =   DH-,tanaKo   and   db   =   Dn  ( tan(a^,o+6  ) -tana^^o) ;    therefore, 
by    substitution: 


and 


Do    =  D»-,  nJ  ( 1    +    tan■^■a^,o)    =   Df-,secai.-,o 
D    =   Di.  J  ( 1    +    tan^(ah-,o    +   6 )  )    =   Dt^secCa^o    +   6) 


(B-2) 


Since  the  vertical  field  of  view  is  represented  by 

FOV  =  2  D  tan(a^/2)  (B-3) 

the  ratio  of  the  two  fields  of  view  shown  above  is 


FOV  ratio  = 


FOV     2(D^,sec(aMo  +  (5 )  )  tan(a^/2  ) 


FOVo     2(DHseca..o)  tan(a^/2) 


(B-4) 
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which  simplifies  to; 


sec(aH..o  +  6)  cos(aH,o) 

FOV  ratio  =  =  (B-5) 

sec(at^no)      cosCa^.o  +  6) 


For  the  purpose  of  this  derivation,  a^  has  been  chosen 
as  the  horizontal  angle  of  equivalence,  so  that  the  angular 
separation  equates  on  the  screen  to: 


C  -  Co 

<5  =  a^-,  (B-6) 

360 


The  field  of  view  ratio  now  becomes  the  Pixel  Ratio  by 

substitution  for  6    in  the  FOV  ratio  equation  (B-5). 

cos(ap,o) 

PRm  =  PIXEL  RATIO  =  (B-7) 

C  -  Co 

cosCat^o  +   at.) 

360 

The  h  subscript  indicates  that  this  pixel  ratio  adjusts 

for  the  distortion  due  to  the  differing  distance  in  the 

horizontal  direction. 
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APPENDIX  C 
BASIC  PROGRAMS  TO  DETERMINE  PIXEL  RATIOS 
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100 

110 

120 

130 

140 

160 

160 

170 

180 

190 

200 

210 

220 

230 

240 

250 

260 

270 

280 

290 

300 

310 

320 

330 

340 

350 

360 

370 

380 

390 

400 

410 

420 

430 

440 

460 

460 

470 

480 


I  *  « 


*************«***«»**•*********««««« 


I  «  « 
I  *  * 

•  *  iir 


PROGRAM    TO    DETERMINE   MAXIMUM    AND    MINIMUM    PRV 

*«***«*«**«*AA*A**«A****«**llt)lr«1lr««r**««ilr**«A*tlrX««****««A*««r***«*ilrA*i 

PI    =    4*ATN(1) 

PRINT    "ALPHAV"; "    SIGMAV", "PRSUBl", "PRSUB2", "PRSUB3", "PRSUB4" 
PRINT 
PRMAX    =    0 
PRMIN    =    100 

•45    DEGREES    IS    THE    MAXIMUM    VERTICAL   ANGLE   CONSIDERED 
FOR    ALPHAV    =    0    TO    45    STEP    5 
ALPHAVEE    =   ALPHAV    *    PI/180 
PRINT. ALPHAV 
•CAMERA    USED   FOR    RESEARCH    HAS   VERTICAL   FOV   OF    5-33    DEGREES 
FOR    SIGMAV    =   5    TO    33    STEP    4 

SIGMAVEE    =    SIGMAV    *    PI/180 

•CASE    1:       REFERENCE    AT    TOP    OF    SCREEN,     POINT    OF    INTEREST    AT    BOTTOM. 

PRSUBl    =   COS(ALPHAVEE    +    SIGMAVEE* . 5 ) /COS( ALPHAVEE    +    SIGMAVEE* (-. 5 ) 

•CASE    2:       REFERENCE    AT   TOP    OF    SCREEN,    POINT   OF    INTEREST    AT    MIDDLE. 

PRSUB2    =   COS( ALPHAVEE    +    SIGMAVEE* . 5 ) /COS( ALPHAVEE ) 

•CASE    3:       REFERENCE    AT    MIDDLE   OF    SCREEN,    POINT   OF    INTEREST   AT    TOP. 

PRSUB3    =   COS( ALPHAVEE) /COS( ALPHAVEE    +    SIGMAVEE 

•CASE    4:       REFERENCE    AT    MIDDLE    OF    SCREEN,    POINT 

PRSUB4    =   COS( ALPHAVEE) /COS( ALPHAVEE    +    SIGMAVEE 


*     .5) 

OF    INTEREST 

*(-.5) ) 


AT    BOTT 


PRINT    SPC( 8 ) ; SIGMAV, PRSUBl, PRSUB2, PRSUB3, PRSUB4 


IF  PRMAX  <  PRSUBl  THEN  PRMAX  =  PRSUBl 
IF  PRMAX  <  PRSUB2  THEN  PRMAX  =  PRSUB2 
IF  PRMAX  <  PRSUB3  THEN  PRMAX  =  PRSUB3: 
IF  PRMAX  <  PRSUB4  THEN  PRMAX  =  PRSUB4 : 
IF  PRMIN  >  PRSUBl  THEN  PRMIN  =  PRSUBl: 
IF  PRMIN  >  PRSUB2  THEN  PRMIN  =  PRSUB2: 
IF  PRMIN  >  PRSUB3  THEN  PRMIN  =  PRSUB3: 
IF  PRMIN  >  PRSUB4  THEN  PRMIN  =  PRSUB4 ; 
NEXT    SIGMAV      ' 

NEXT    ALPHAV 

END 


MAX=SIGMAV 

MAX=SIGMAV 

MAX=SIGMAV; 

MAX=SIGMAV: 

MIN=SIGMAV; 

MIN=SIGMAV; 

MIN=SIGMAV; 


SUB=1 
SUB  =  2 
SUB  =  3 
SUB=4 
SU  =  1 
SU  =  2 
SU  =  3 


MIN=SIGMAV:SU=4 
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17 
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.9629531        ] 
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21 
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.9511219       J 
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PROGRAM  TO  DETERMINE  MAXIMUM  AND  MINIMUM  PRH 


«  « 


SIGMAH", "PRSUBl", "PRSUB2", "PRSUB3", "PRSUB4" 


BE  THE  MAXIMUM  CONSIDERED 


PI  =  4*ATN(1) 
PRINT  "ALPHAH"; 
PRINT 
PRMAX  =  0 
PRMIN  =  100 

•45  DEGREES  OFF  CENTER  WOULD 
FOR  ALPHAH  =  0  TO  45  STEP  6 
ALPHAHEE  =  ALPHAH  *  PI/ 180 
PRINT  ALPHAH 

•THE  CAMERA  USED  FOR  THIS 

FOR  SIGMAH  =  8  TO  44  STEP  4 

SIGMAHEE  =  SIGMAH  *  PI/180 

REFERENCE  AT  FAR  RIGHT, 

COS( ALPHAHEE  +  SIGMAHEE* 

REFERENCE  AT  FAR  RIGHT, 

COSCALPHAHEE  +  SIGMAHEE* 


RESEARCH  HAD  H0RI2  FOV  OF  8-44  DEGREES 


•CASE  1: 
PRSUBl  = 
•CASE  2: 
PRSUB2  = 
'CASE  3: 
PRSUB3  = 
•CASE  4: 
PRSUB4  = 


POINT  OF  INTEREST 
6) /COS( ALPHAHEE  + 
POINT  OF  INTEREST 
5) /COS (ALPHAHEE) 


AT  FAR  LEFT. 
SIGMAHEE* (-.5 
AT  CENTER. 


REFERENCE  AT  CENTER,  POINT  OF  INTEREST 

COS(ALPHAHEE)/COS(ALPHAHEE  +  SIGMAHEE  * 

REFERENCE  AT  CENTER,  POINT  OF  INTEREST 


AT  FAR  RIGHT, 

.5) 

AT  FAR  LEFT. 


COS (ALPHAHEE) /COS (ALPHAHEE  +  SIGMAHEE  *(-.5)) 


PRINT  SPC ( 8 ) ; SIGMAH, PRSUBl , PRSUB2, PRSUB3 , PRSUB4 


IF  PRMAX  <  PRSUBl 
IF  PRMAX  <  PRSUB2 
IF  PRMAX  <  PRSUB3 
IF  PRMAX  <  PRSUB4 
IF  PRMIN  >  PRSUBl 
IF  PRMIN  >  PRSUB2 
IF  PRMIN  >  PRSUB3 
IF  PRMIN  >  PRSUB4 
NEXT  SIGMAH 

PRMAX=0 

PRMIN=100 

MAX=0 

MIN=0 

SUB=0 

SU=0 

NEXT  ALPHAH 

END 


THEN 
THEN 
THEN 
THEN 
THEN 
THEN 
THEN 
THEN 


PRMAX 
PRMAX 
PRMAX 
PRMAX 
PRMIN 
PRMIN 
PRMIN 
PRMIN 


PRSUBl 
PRSUB2 
PRSUB3 
PRSUB4 
PRSUBl 
PRSUB2 
PRSUB3 
PRSUB4 


:MAX=SIGMAH: 
;MAX=SIGMAH: 
MAX=SIGMAH; 
MAX=SIGMAH: 
MIN=SIGMAH: 
MIN=SIGMAH; 
MIN=SIGMAH: 
MIN=SIGMAH; 


SUB  =  1 
SUB  =  2 
SUB  =  3 
SUB=4 
SU  =  1 
SU  =  2 
SU  =  3 
SU  =  4 
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